Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter- and Intra-modality Attention
نویسندگان
چکیده
While Machine Comprehension (MC) has attracted extensive research interests in recent years, existing approaches mainly belong to the category of Reading task which mines textual inputs (paragraphs and questions) predict answers (choices or text spans). However, there are a lot MC tasks that accept audio input addition input, e.g. English listening comprehension test. In this paper, we target problem Audio-Oriented Multimodal Comprehension, its goal is answer questions based on given information. To solve problem, propose Dynamic Inter- Intra-modality Attention (DIIA) model effectively fuse two modalities (audio textual). DIIA can work as an independent component thus be easily integrated into models. Moreover, further develop Knowledge Distillation (MKD) module enable our multimodal accurately only either audio. As result, proposed approach handle various including: Listening single model, making fair comparisons possible between unimodal Experimental results analysis prove effectiveness approaches. First, boosts baseline models by up 21.08% terms accuracy; Second, under scenarios, MKD allows significantly outperform 18.87%, trained tested with data.
منابع مشابه
Modality management for multimodal human-machine interfaces
Synergistic multimodal human-machine interfaces are characterised by their ability to interpret user input from more than one input modality. Such interfaces may contribute to better driver information systems in terms of efficieny and comfort of use. In this article we present an approach for the integration of voice and touchscreen input as well as capacitive proximity sensing for two scenari...
متن کاملSustained Spatial Attention in Touch: Modality-Specific and Multimodal Mechanisms
Sustained attention to a body location results in enhanced processing of tactile stimuli presented at that location compared to another unattended location. In this paper, we review studies investigating the neural correlates of sustained spatial attention in touch. These studies consistently show that activity within modality-specific somatosensory areas (SI and SII) is modulated by sustained ...
متن کاملBidirectional Attention Flow for Machine Comprehension
Machine comprehension (MC), answering a query about a given context paragraph, requires modeling complex interactions between the context and the query. Recently, attention mechanisms have been successfully extended to MC. Typically these methods use attention to focus on a small portion of the context and summarize it with a fixed-size vector, couple attentions temporally, and/or often form a ...
متن کاملDYNAMIC COMPLEXITY OF A THREE SPECIES COMPETITIVE FOOD CHAIN MODEL WITH INTER AND INTRA SPECIFIC COMPETITIONS
The present article deals with the inter specific competition and intra-specific competition among predator populations of a prey-dependent three component food chain model consisting of two competitive predator sharing one prey species as their food. The behaviour of the system near the biologically feasible equilibria is thoroughly analyzed. Boundedness and dissipativeness of the system are e...
متن کاملAttention-based Multimodal Neural Machine Translation
We present a novel neural machine translation (NMT) architecture associating visual and textual features for translation tasks with multiple modalities. Transformed global and regional visual features are concatenated with text to form attendable sequences which are dissipated over parallel long short-term memory (LSTM) threads to assist the encoder generating a representation for attention-bas...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i14.17548